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ASYMPTOTIC  FORMULAE  FOR  THE  DISTRIBUTION  OF 


HOTELLING'S  GENERALIZED  STATISTIC 
By  Koichi  Ito 

University  of  North  Carolina 


1 . Summary. 

In  this  paper  the  asymptotic  expansion  of  a percentage  point  of  Hotelling's 
2 

generalized  Tq  distribution  is  derived  in  terms  of  the  corresponding  percentage 

2 

point  of  a X distribution.  Our  result  generalizes  Hotelling's  and  Frankel's  asymp- 
totic expansion  for  the  generalized  Student  T £2 _J f £^>_J  • The  technique  used  in 

2 

this  paper  for  obtaining  the  asymptotic  expansion  of  T^  is  an  extension  of  the  pre- 
vious methods  of  Welch  £l_/  and  of  James  £^J,  /~5_7>  wh°  used  them  to  solve  the 
distribution  problem  of  various  statistics  in  connection  wih  the  Behrens-Fisher  pro- 
blem. An  asymptotic  formula  for  the  cumulative  distribution  function  (c.d.f  ) of 

2 

Tq  is  also  given  together  with  an  upper  bound  for  the  error  committed  when  axl  but 

I 

the  first  few  terms  are  omitted  in  the  series.  This  formula  is  a sort  of  multivari- 
ate analogue  of  Hartley’s  formula  of  " Student izat ion"  £l £J . 

2.  Introduction . 

In  the  multivariate  analysis  of  variance  we  use  the  following  canonical  proba- 
bility law:  | 

I 

(2.1)  P(XQ,  Xj  = const.  exP/‘4tr  A ^ - |)  (X^  - % ' ) -|tr  A XqXq  JdX^, 

1 • 

where  X^  and  XQ  are  pxm  and  pxn  matrices  respectively,  and  — X^X^  = S ^ is  the  sample 

1 i 

"between"  dispersion  matrix  and  — X^X^  = Sq  is  the  sample  "within"  dispersion  matrix, 

the  prime  denoting  the  transpose  of  a matrix.  | is  a p x m matrix,  ~ being  the 

population  "between"  dispersion  matrix,  and  A is  a p x p symmetric  positive  defi- 
nite matrix.  It  is  assumed  that  m may  be  > p or  < p,  but  n > p.  To  test  the  null 

1 Work  at  Chapel  Hill  sponsored  by  the  Office  of  Naval  Research  under  Contract 
NR  04  2 031.  Reproduction  in  whole  or  in  part  is  permitted  for  any  purpose  of 
the  United  States  government. 
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hypothesis  H^:  | = 0,  H telling  /_2_7  proposed  a test  based  on  the  statistic: 

(2.2)  T2  = m tr  S.S!1 

0 1 o 

and  derived  the  exact  distribution  of  this  statistic  when  p = 2 and  | = 0.  For  gen- 

2 

eral  values  of  p the  exact  distribution  of  is  not  available  at  present,  even 
in  the  null  case  | = 0. 

2 

3.  Derivation  of  asymptotic  formula  of  Tq. 

For  general  values  of  p it  is  known  that  the  statistic 

(3.1)  X2  = b tr  Sj  ^ 

2 

has  a X distribution  with  m p degrees  of  freedom.  That  is  to  say,  we  have 

(3.2)  Pr  (m  tr  A < 29  } = Gp(0)  , 

2 

where  29  denotes  the  tabled  value  of  X for  a particular  level  of  significance, 
p = mp/2,  and 

e 

(3.3)  Gp(e)  = /"r( P27'1  J tp-1  e_t  dt. 

o 

Hence,  if  /\  is  known,  the  statistic  X given  by  (3.1)  may  be  used  to  test  ex- 
actly, and  if  /\  is  unknown  but  if  Sq  is  based  on  a large  number  of  degrees  of 
freedom,  i.e.,  if  n is  large,  we  may  use  as  an  approximation  the  result: 

(3.4)  Pr  jm  tr  S.^"1  < 29  \ = !p(e). 

This  suggests  that  in  the  general  case  we  try  to  find  a function  h(S^)  of  the  ele- 
ments of  Sq  such  that 

Pr  jm  tr  S^'1  < 2h(SQ)  | = Gp(©)  . 


(3.5) 
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2 

when  n is  large,  2h(.SQ)  will  approach  20  = X , and  we  now  expect  to  write 

2h(S^)  as  a series  with  X as  its  first  term  and  successive  terms  of  decreasing 

order  of  magnitude. 

Now 

(3.6)  Pr  (m  tr  S^"1  < 2h(Sc  ) j = f Pr  j m tr  S^S'1  < 2h(S0)  |S0  j Pr  j d SQj  , 

R 


where  the  first  expression  on  the  right  denotes  the  conditional  probability  of  the 
relation  indicated  for  fixed  values  of  the  elements  of  S^,  and  the  second  denotes 
the  probability  element  of  S^,  which  has  a Wishart  distribution 

with  n degrees  of  freedom,  and  the  domain  of  integration  R is  over  all  possible 
values  of  the  elements  of  Sq.  Now  we  may  expand  Pr^m  tr  S^S"  < 2h(SQ)  |SQ  j about 

an  origin  (a  , o , a , o , o ) in  a Taylor  series,  where 

-LX  c.d  pp  d.  p~X^p 


= ^exp/"tr(S0  - A _1)6_7 j Pr  jm  tr  S^A  < 2h(  A "1)J  , 


where  e . , 
oij 


is  the  i-th  row, 


j-th  ^column  element  of  S^,  and  c>  denotes  the  matrix  of 


derivative  operators: 


I 


(3.8) 


- k - 


hA- 


1 8 ! d 

^a~ 
pl  p2 


; a 
*3% 


its  typical  element  being  d.  . = 4(1  + S.  ,)  •>— — , where  5 . . is  the  Kronecker  delta. 

ij  ^ ij  da.  ij 

^ J 


Whether  uniformly  convergent  or  not,  the  right  hand  side  of  (3-7)  is  an  asymptotic 

representation  of  Pr  ‘m  tr  S.,S"^  < 2h(S,_)  Is.  , for  sufficiently  large  values  of  n. 

I 10—  o 0 ,, 

Hence,  substitution  of  (3.7)  into  (3-6)  and  term  by  term  integration  which  may  be 
done  legitimately,  yields: 

(3.9)  Gp(e)  = ( exp/'tr(S0- A_1)8_7  Pr  jm  tr  S^A  < 2h(  A”1)]  Pr  jd  SQj 


= 0 Pr  * m tr  S0  < 2h(  A"1)  j , 


where 


0 = j exp/”tr(S0  -A”x)8 J Pr  /d  SQi  . 


Since  SQ  has  a Wishart  distribution  with  n degrees  of  freedom,  we  have 

n n-p-1 

0 = exp  /'~tr  A_18_7'const.|  A |2  j |SQ|  c exp/  tr  (s0  d - 75  As  Q)J  d SQ 

A 

n - n-p-1 

= exp  /"-tr  A_18_7 * const.  |A|2  j |SQ|  c exp/"-  | tr(  A - ~ d)S d SQ 

~R 

— n 

= exp  /"-  tr  A'1  8_7-  | A |2  | A - ! df  2 

n 

= exp  /"-tr  A_1  8_7  • | I - | A"1  8 1 2 , 


- 5 - 


where  I is  the  p x p identity  matrix.  Now  using  £*> _J , 

(3.10) 


-log  |l  - Y|  = tr  Y + ^tr  Y2  + i tr  Y3  + ..., 


ootain 


(3.11) 


0 = exp  trA'1^  - | log  | I - ^ A"-^  I _7 


-1, 


= exp  tr  /\~±&  + | ^tr(|A1d)+-|tr(|/"1a)2+itr(^A~1S)+...J  J 

= exp  £ - tr(A-1S)‘1  + — ^ tr  ( A"1^)5  + • • «_7 

3n 


1 . , A-li\2^  1 1 A , a-A\3  l/*  / A-1  + 0(n”3) 

- tr ( A d)  + — p { -tr(  A d)A--(tr(A  o)  ) J v 


= 1+ 

n " ' ' n£  ' ^ ' * ' 

It  is  to  be  noted  h-.ru  that  in  (3.11)  the  operator  d does  not  act  on  A_1  Present  in 
0)  itself,  and  it  is  more  useful  for  our  purpose  to  write  (3.11)  in  suffix  form: 

(3.12)  0 = 1 + £ ^ 


a o.  d , d 
rs  tu  st  ur 


+ ^1^0  a a & b d + is  a a a a d . d S d 'l, 
£ l 3 rs  tu  vw  st  uv  wr  2 rs  tu  vw  xy  st  ur  wx  yvj 


+ 0(n-3), 

where  1 denotes  the  summation  over  all  suffixes  r,  s,  ...  each  of  which  ranges  from 
1 to  p. 

Now  we  represent  h(S^)  as 


(3.13) 


b(SQ)  =0+  h1(SQ)  + hr(S0)  + ..., 


hs(SQ)  being  of  order  n“3,  i.e.,  we  write  h(S0)  as  an  asymptotic  series  such  that 

|nS  [h(SQ)  - 9 - h1(S0) VV)  1 

is  made  arbitrarily  small  for  sufficiently  large  values  of  n.  Then  (3.13)  may  be 
substituted  into  Pr  J u tr  S,  A < 2h(  A l , and  by  Taylor's  expansion  we  have: 

L 1 ~ -» 
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O.lM  Pr  wn  tr  ^ A < 2h(  A"1)  j 

= exp/'  | h1(  A'1)  + h2(A'1)  + ...^D_7  Pr  |m  tr  S± /\  < 20  j 

2 

= /'  1+  Jh1(  A'1)  + h2(  A'1)  + ...  J D + I (h1(  A-1)+b2(  D £+..._7 

x Pr  tr  A < 20  j , 

vhere  D = -A  . By  substituting  (3.12)  and  (3.1*0  into  (3*9)  we  obtain 


(3.15)  G («)  = /'I  + - £ o jAA  + i 

pv  *-  n rs  tu  st  ur  2 3 

n '• 


o a o A .A  d 
rs  tu  vw  st  uv  wr 


+ ~£  o c a a d d d A L + 0(n~O  J 
2 rs  tu  vw  xy  st  ur  vx  y vj  — 

x /~1+  h1(  A_1)D  + Jh£(  A"1)  + |h^(  A"1)  D2}  + 0(n"5)_7 

x Pr  | id  tr  A < 2©  1,  . 


By  eq  .-ting  terms  of  successive  order  in  (3.15)  we  obtain 

(3.16)  K<  A’V*  i.  °rs"tuaBtiur  j Pr  {»  tr  Sj  A < E6>  } - 0, 


(3.17)  /h2(  A-bo  ♦ Jh|(  A'1)!)2  . i.  °rs«tu  ( A'1)  D 

. A”*  ‘A'1)  V>  + 3stV) 


2.  o a o d o d + — baaaaS.ddd  7 
rs  tu  w st  uv  wr  „ 2 rs  tu  vw  xy  st  ur  vx  yv-' 

2n 


x Pr  j m tr  S1 A < 28 j =0, 

and  so  on,  where  h!jst^(  A"1)  = A"1)  and  h^st,  ur^  (A”1)  = ^ur^st  hi(A-1)' 


+ . 
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It  now  remains  to  carry  out  the  operations  8 and  D indicated  in  (J.l6)  and 
(3-17)in  order  to  obtain  h^/V-1),  hg(  /\  1)  and  hence  ^(Sq^  kg(S  ).  ^he6e  opera- 
tors  will  operate  on  Pr|m  tr  S^A  < 29  j which  is  a p x in  fold  integral,  and  the 
operations  may  be  thought  of  as  differentiations,  with  respect  to  the  boundary  only, 
of  the  integral  of  the  probability  density  function  of  the  X^  throughout  a region  in 
the  space  of  X^.  The  method  used  to  evaluate  Pr  |m  tr  A < 20 j , 

^st^uv^wr  Pr  ) m tr  /\  < £9  | , . ..,  is  to  change  the  boundary  slightly,  expand  the 

integral  in  powers  of  the  quantities  specifying  this  change,  and  obtain  the  deriva- 
tives by  comparison  with  Taylor's  expansion.  We  consider 


(3.18) 


J = Pr  [m  tr  A-1  + e)"1  < 29 J , 


where  € is  a p x p symmetric  matrix.  Then  by  Taylor  expansion  we  have 


(3.19) 


J » ( 1 + L e 8 + 1 e e 8 8 

(_  rs  rs  21  rs  tu  rs  tu 

+ -rr  1 € 8 8.8  +-^—Le  e e e 8 8^  8 8 

3!  rs  tu  vw  rs  tu  vw  41  rs  tu  vw  xy  rs  tu  vw  > 


+ . . . v Pr  | m tr  A < 2©  J-  . 


On  the  other  hand,  J is,  by  definition,  written  as 


(3.20) 


J = JAi_ 

PE 


exP  r - I tr  A Xx  X'±J  dX1  , 


(2it)c  R' 


where  X.^X|  = m3^,  and  domain  of  integration  R'  ranges  over  all  possible  values  of 

-1  *“1 

the  elements  of  X1  such  that  m tr  S^( A + e)  ^ < 29.  It  is  now  easy  to  show 


that  integration  of  (3.20)  yields 


(3.21) 


jl-D  E|  "2 

J ■ ‘WV  > °0(e>- 


«\ 
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where  is  a diagonal  matrix  which  satisfies 


(3.22) 


Xx(p  x m)  = T(p  x p)  z(p  x m), 
Jr'CA"1**)'1  r*  = I(p), 


and  |r*  A I*  - I(p)  - D , 

* n 

T being  a non-singular  matrix,  and  E is  an  operator  such  that 


EGp(e)  = Vi(e)- 


Now  letting  A = E - 1 and  using  (>.22)  we  have 


|I-D*I  |I-D  -DA| 


U-D^i 


lI-Dq I 


||r' a r - {|r'(  a”1+€)'1t- -|r’A  r]  a| 
l|r’A  r| 


I A - {(  A'^c)'1  - A)a|  ini, 

1 L_  = |i-  jA-^A*^)*1-!  A| 


•“  |l  - x Ai 


where  X - A (A  + O ^ - I.  Hence  (3.21)  becomes 


(3.23) 


J = |I  - X A|  2 Gp(e) 


Now  using  (3.10)again,  we  rewrite  (3.23)  as 


(3.24)  J = exp  i^-  | log  | I - XA|  j Gp(e) 

= exp[l  tr  XA  + ^ tr  X2A2+  | tr  X5A5+  g tr  X^A^  + ...  j Gp(0) 
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2 2 j 

/'I  + 2tr  XA  + [“tr  X2  + (tr  X)S}  A2  + j g tr  X5+^-(tr  X)(tr  X^+j^tr  X)^J 

+ [§  tr  X +I5(tr  x)(tr  x5)  + 52(tr  35x2)2 
+ |^(tr  X)2(tr  X2)  + ^.(tr  X)4  j aV  . .JL7Gp(e;  . 


| 


X can  be  represented  as 

(3.25)  X = A “1(/\_1+<)‘1-I  = A-1(  A"1+  L«rsArs)’1  -1  3 (I+£ers 

* L €rSStu‘v»VArBA)(A^A)(A^A)<A^A  > - 

where  A”g  is  a p x p matrix  obtained  by  operating  drg  on  A,  i.e., 

row,  J-th  column  element,  -5(6  6 +5  .6  ).  Writing 

c.  n 8 j si  rj 


A-1A)"1 


I 


AXA^AMA^A) 


• • • ; 


has  its  i-th 


tr(A-1A)  = (rs), 

rs 

tr(  A“g  A)(  A'^A)  = (rs  |tu) , 

tr(  A rs  A ) ^ A tuA  )(  A vw  A ) = (rs|tu|vw), 

tr(A ‘g  A )(A^A)(A^A)(A‘y  A)  *=  (rs  |tu|vw  |xy). 


and  substituting  (3.25)  into  (J.2U),  we  obtain 

2 

(3.26)  J = /"l+^rs  [-|(rs)A}  +|y  £ers€tu  [ (ra  itu)(nA+|A2)+^-(rs)(tu)A2) 
+jrl€rs€tuevw  | (rs  |tu|vw)(-5mA-3mA2-mA5)+(rs)(tu|vw)(-|n2A2-^ii2A5)-^-(rs)(tu)(vw)A^j 

+¥TIf  rs  £tu  evw  £xy  \ (rs  ltu  iw  Uy  ) ( 12irA+  l8rcA2+  12nA^+  3mA4 ) 

♦ (rs  ) (tu  |w  |xy)  (6mc’A2+6m2A^+£m2A4)  + (rs  |tu)  (vw  jxy  ) ( 3ie2A^+ 3ni2A^+-£m2A4 ) 
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j ^ 4 ■) 

+ (rs)  (tu) (vw  |xy) (-|m^A^+£m^A4)  + (re)  (tu)  (vw ) (xy)  ^ A | + . . Gp(0). 

Then  term  by  term  comparison  between  two  expansions  for  J,  (3*19)  and  (3.26),  gives 
c^s  Pr  ( m tr  S1 A < 2©  j , ^r6^tu  Pr  { m tr  A < 26  j , etc.,  but  in  doing  so  we  must 

take  such  a care  that,  for  example, 

£ a.  e .e  .e,  = £ b.  e .£  ,£. 

ijk  i j k ijk  i j k 

implies  a = b if  both  a.  and  b.  are  completely  symmetrical  in  their  suf- 
IjK  IJK  IJK 

fices.  With  this  in  mind  and  using  the  relation 

A Gp(6)  = - E gp(e), 

where  gp(©)  * D Gp(©),  we  obtain 

(3.27)  ^r8  Pr  [m  tr  S1A  < 2©  j = |(rs)  Egp(0), 

2 

(3.28)  Srg9tu  Pr  {m  tr  < 29  j = - jj(rs  | tu)(E?  + E>*  y(rs ) (tu)  (E2-E)]  6p(e) 

(3-29)  ^rs^tu^vw  Pr  [m  tr  Si  A < 29  j = |^m(rs  |tu  |vw ) (E^+E  +E)+5^-  /~(rs)  (tu  |vw) 

+ (tu)(rs  |vw)+(vw)(rs  [tu)_7  (E^-E)+  ^j(rs  ) (tu)  (vw  ) (E^-2E2+E)  j 

' S(Q)' 

(3.30)  ^s&tu\Ay  Pr  |m  tr  S^A  < 26  m/"(rs  |tu  |vw  |xy)+  (rs  |vw  |xy  |tu) 

2 

+ (rs  |xy  |tu  |vw)_7 (E4+E^+E2+E)+  ~/”(rs)  (tu  |vw  |xy)+(xy)  (tu  |w  jrs) 

2 

+ (vw)(tu|xy|rs)+(tu)(vw  |xy|rs)_7  (EU-E)  + ^-/”(rs  |tu) (vw  |xy ) 

+ (rs  |vw)(tu|xy)+(rs  |xy)(tu  |vw)_7(E4+E^-E2-E)+  ^ ^“(rs)(tu)  (vw  |xy) 

+ . '•  ( (tu  jxy)+  (rs ) (xy)  (tu  |vw  )+ (tu) (vw  ) (rs  |xy)+ (tu)  (xy)  (rs  |vw  ) 

+ (w  ) 'yy, ) (rs  |tu)_7(E4-E5-E2+E)+2^.(rs)  (tu)  (vw  ) (xy)  (EU-3E5+3E2-E)j  gp(G)  . 
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Upon  substituting  (3.28)  into  (3.16)  we  obtain 


hl(A'1)  = k Z °rs°tu  Z"an(^lur>l^IT+  |)+  “2(^t)(ur)  - | j J 


Now  (st)  = tr  A I*  A = \ L + * cyBt  ) 


'st''  2 7 , weiutJ  "'trsj' 

1*  J 


= a 


st 


and  also, 


(st  |ur)  = tr  (A‘^A)(A‘^A) 


1.  re  tu  flu  tr . 
■g(0  o+oo) 


Hence  we  have 


and 


E CTrsatu^st lur^  = -P^P^1) 


L °rs°tu(st)(ur)  = p* 


We  also  note  that  20  = X , p = mp/2.  Therefore  we  finally  obtain,  after  some  simpli- 
fication, 

(5.31)  ma-1)  (p—i)x2j  . 

In  a similar  way  we  substitute  (3.29),  (3. 30)  and  (3.51)  into  (3-17)  to  evaluate 
hg(  A-1).  We  note  here  that  since  h^A"1)  given  by  (3.31)  is  independent  of  A-1, 

the  terms  involving  h^8t\  A 1)  and  h|st,ur ^ ( A _1 ) in  (3.17)  do  not  appear.  As  before, 
it  can  be  easily  shown  that 


Z ars°tu°vw(st,uv|wr)  = B P(P2+3P^),  1 arsatuavw(st)(uVlWr) 


= Z °rsatu°w(uv)(stlwr)  = £ 0rs°tu0vw(wr)(stiuv)  = |p(p+l), 

L 0rs0tu°vw(stAuvAwr)  = P>  L °rsatu0vw0xy(st|urlwx,yv) 


£ 0rs0tu°vw0xy(St'WX*yv|ur)  = k P<I*1)2' 
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£0000  (st|yv|ur|wx)  = rp(pO),£  a ° ° ° (st) (ur |wx |yv) 

rs  tu  vw  xy  H rs  111  vw 

= * CTrs0tuaw°xy(jrV)(Ur,Wx|6t)  = **  0r8OtuOvwaxy(WX)(uriyv|6t) 

= £ o o^_  a o (ur)(wx |yv |st)  = ip(p*l)< 
rs  tu  vw  xy'  ^ 


£ 0rsCJtuOvwOxy(st  *Ur)(WX*yv)  = ^(P*-1)^  L ars °tU°vw °xy(st  lwx)  (ur  'yv) 


xy 


* **  arsatu0vwaxy(8t|yV)(Ur,WX)  = ¥^l]’ 


1 2, 


^ arsatuavwDxy(St)(Ur)(Wx|yV)  = * °rs0tu°vwaxy(wx)(yv)(st |ur)  = 2*  (pfl) 


rs  tu  vw  xy 


£0000  (st) (wx) (ur  |yv)  = £ 0 0.  o 0 (st) (yv)(ur Iwx) 

rs  tu  vw  xyv  /v  'v  rs  tu  vw  xy'  w ' 1 

= £ 0 0^  0 0 (ur ) (wx) (st lyv)  = £ o 0 0 0 (ur) (yv)(st  Iwx)  = p. 
rs  tu  vw  xy'  1 rs  tu  vw  xy  w ' 1 


and 


£ o 0.0  0 (st)(ur)(wx)(yv)  = pc 

rs  tu  vw  xy'  ' ' J ^ 


Using  these  results  we  obtain  from  (3.17),  after  some  simplification. 


(3.32)  h (A-1)  = X8 


48n 


(mp+2)  (mp+4) (mp+6) 


. 4mp  +2(3m  + 3ntf-10)p  +2(2m  + 3m  + 17ti+  l8)F*4(5m  + 9ntf-2)  v6 

+ — j \ 

(mp+2)  (mp+4) 

+ -3g  *7  **  + {7p2+{-12mfl2)p^(7mL-iaiH-l)}  X2  J, 

which  is  independent  of  A"P  just  as  h-^A”1). 

Now  we  substitute  (3.51)  and  (3-32)  into  (3.13)  to  obtain 


(3.33)  T2  = 2h(S0)  = 29+  2h1(S0)+2h2(S0)+0(iri)  = X2+  X4+(F-mH)  X2} 

, 1 /6(p-l)(p+2) (m-1 ) (m+  2)  v8.  4mp5+2( 3m2+  3m+  10)p2+  2(2m5+  3m2+  17m+  l8)p+  4(5m2+9m+ 2),/ 

2\f  . a2, 


24n  \ (mp+2) ^ (mp+4)  (mp+6) 

f24P-l 
2 


(mp+2)  (mp+4) 

+ - 1 4 - f?r X4£_I.  x4  + /“ 7P2+  ( - 1 2m+ 12 ) p + (7m2-12m+l)_7x2j  + 0(n-:5), 
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which  is  the  asymptotic  expression  of  a percentage  point  of  the  Tq  distribution  in 

2 

terms  of  the  corresponding  percentage  point  of  the  X distribution  with  mp  degrees 
of  freedom. 

If  we  put  m * 1 in  (3.33) , we  have 

(3.34)  T2  «»  X2  + (x4  + pX2}  + — | 4x6  + (13p  - 2)  X4  + (7p-4)  X2]  ^ 0(n~5), 

2n  > 24n2  i 

which  is  the  asymptotic  expression  of  a percentage  point  of  the  generalized  Student 
T distribution.  This  result  (3.34)  was  previously  obtained  by  Hotelling  an<3  Fran- 
kel  l~2 J,  J‘3J. 

2 

4.  Asymptotic  formula  for  the  c.d.f.  of  Tq. 

Let  F(2©1)  be  the  c.d.f.  of  T2,  i.e., 

(4.1)  F(261)  - Pr  }m  tr  SjSjJ1  <2 © J . 

Then,  as  (3-6),  we  can  write 

(4.2)  Pr  (m  tr  S^”1  < 26x  } * j Ft  |m  tr  S^”1  < 2Q±  | SQ  } Pr  (dSQ} 

R 

= © Pr  [m  tr  SJL  A < 28x  j , 

where  Q is  given  by  (3.12).  Upon  substituting  (3.26),  (3.29)  and  (3«30)  Into 

(4.2)  we  obtain,  after  some  simplification, 

2(pfm+l)e2  . 

(4.3)  F(201)  = Gp(©1)  - ± [-5^2— i + (p-m+l)eij  g^) 

^ 24  { mp^+2(m2+nn-4)p2+ (m^+2m2+21m*-20)p*fim2+2Cto+-20  j 04 

48n2  ^ (mp+-2)(mp*-4)(mp*-6) 

4 | 3mp^-2(3m2-3o-4)p2-3(3m^+2m2+llm-4)p-4Cbi2-3&i-4j 

(mp*2)  (mp*-1 
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2 [3mp3+2(3m2+3m-4)p2-3(3m3-2m2-5D*4)p-8m2+12ii*4}  ®i 

mpf2 

- |3mp3-2(3m2-3m+4)p^+3(m3-2m2+5ni-4)p-8in  + 13n+4^  ©^  — 4 0(n  ^ > 

©1 

where  G^e^)  = (p) _7"’J‘ J tP”^  e ^ dt,  g^(©^)  * p = mp/2, 

0 1 

(4.3)  is  a sort  of  multivariate  analogue  of  Hartley’s  formula  of  "Studentization. 
In  fact  it  can  be  shown  that  when  p * 1,  (4.3)  coincides  with  Hartley’s  formula 
for  the  c.d.f.  of  the  univariate  analysis  of  variance  F statistic.  (See  equation 
(28),  p.  178,  fl J.) 


5.  Discussion  of  the  error  and  remarks. 

In  view  of  the  methods  used  in  sections  3 and  4,  it  is  rather  difficult  to  set 

a bound  for  the  error  committed  by  omitting  all  terms  after  the  first  few  terms  in 

2 

the  asymptotic  formula  for  TQ  (3*33)  or  in  the  asymptotic  formula  for  the  c.d.f. 

2 

cf  Tq  (4.3).  There  is,  however,  a method  to  find  lower  and  upper  bounds  to  the 
2 

c.d.f.  of  Tq  which  is  fairly  good  for  large  values  of  n,  and  they  ca-n  be  used  to 

-3  2 

set  a bound  for  0(n  ),  say,  in  the  asymptotic  expansion  of  the  c.d.f.  of  TQ. 

2 

We  shall  first  obtain  lower  and  upper  bounds  for  the  c.d.f.  of  T^.  It  is  well 
known  (e.g.  see  /”6_7)  that  the  joint  probability  law  of  the  characteristic  roots 

el,  e2'  •••>  of  m tr  under  the  null  hypothesis  HQ  is  give*  by 


(5.1)  P(e1,  e 


e ) = C(s,t,p,n) 

o 


t-s-1 
S — x— 

Ttei 

i=l  1 


e<  ' 

(1*^) 


nH-p 

2 


de 


s-1 

i TT  (ei" 
1 ioi 


ej)' 


where  0<e  <e  < ...  < e.  < co,  s-  min  (p,m),  t = max  (p,m)  and 

™*  S " 8 — X ™ " X 

r {'|(n+t-Pf  !)) 


s 

C(s,t,p,n)  = — ■ TT 


St 

2 


i=i  rj|(t-s+i)j  rji(n-pfi)J  T(|) 
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The  statistic  T^  is  expressed  as 


< 5.2) 


To  • " tr  Vo*  ' E,  V 


and  the  c.d.f.  of  TQ  is  given  by 


(5-3) 


t-B-1  rntn  s-l 

^ *ir  t 7 t 


F(2@^)  = C(e,t,p,n)  f • • • f TT  (1+  n ) ^ei  i<J=l^ei*eJ ^ 

J r*  ' i— 1 


where  R is  the  domain  of  integration  such  that  0 < e^  < e0  ^ < . . . < < oo  and 

s 

0 < £ e < 2©1 . Now  for  any  non-negative  values  of  e and  n,  the  following 

i-1  1 


inequality  holds i 


log  a*  s t 


for  i = 1,  s,  where  equality  holds  when  = O or  n > 00.  Hence  we  have 


nrt-n  iw-n  5 

7T  (1  + ^)"  2 > e~  2n  i<=1  6i 

i=l 


Therefore  the  probability  law  (5«1)  is  bounded  from  below  as  follows: 


(5.U) 


where 


Pl(ei>*»*>es)  < P(e1,...,eB) 


t-s-1 


?1(e  ,...,e  ) = C(s,t,p,n)  TT  e de  exp/--  — - L e 7 TT  (e,-e  ), 

i-1  211  1-1  1 i<J=l  1 J 


It  must  be  noted  here  that  P^(e^> . ..,es)  is  not  a probability  law  although  it  is 
non-negative  for  all  e±  such  that  0 < e0  < ...  < < 00.  Now  integrating  both 


sides  of  (5*^)  in  we  obtain 


(5.5) 


F1(2ei)  < F(2ei)  , 


where 
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t-8-1 


= C(s,t,p,n)  J ...  j Jfc±2  dei  exp^  " HE  1^1  Ci-^i^J=1  (ei"e^' 
R1 

and  also  integrating  both  sides  of  (5»^)  in  Rg  w^ere  0 < ea  < ...  < < oo  and 

s 

26  < 2 e < oo  and  subtracting  each  from  1,  we  have 
” 1=1  1 


(5-6) 


where 


F(2©1)  < F2(2©1), 


t-s-l 


S 8—1 


Po(26. ) = 1-C.(s,t,p,n)  ...  TT  e de  exp/"-  75—  2 e._7  TT  (e  - e ). 

J i‘l  1 1 i=l  i<J=l  J 

R2 

2 

In  order  to  evaluate  ^(2©^)  and  Fg(2©1),  we  observe  that  as  n tends  to  oo,  TQ 
« 2 

* 2 e has  a X distribution  with  st  degrees  of  freedom  in  the  limit,  i.e.,  we  have 

i*l  X 

g g g ^ 

(5-7)  K(s,t,p)  f...  ( TT  de  exp/"-  \ 2 e J TT  (e.-e  ) * G_  (©, ) , 

J DJ  i=l  1 1 * i=l  i<J=l  1 J P1  1 


where 


K(s,t,p)  = lim  G(s,t,p,n)  = — 

n >00  22"  TT  r [i(t-s+i)j  r(|) 


and  = st/2.  Hence  integration  of  (5.5)  yields 


(5-8) 


where 


F1(2©1)  - L(s,t,p,n)  G (^p  Q±) , 


L^-n)  - i{H5r  (=k  >2  ■ 


rtiiisei) 


Similarly  we  obtain  from  (5.6) 


(5-9) 


F2(2©1)  = 1 - L(s,t,p,n)  j^l  - Gp  (212  ©1)  j 


- 17  - 


Now  if  we  write  (4.3)  as 


(5.10) 


F(2S1) 


where  R^  is  the  error  committed  by  omitting  all  terms  except  the  first  three  terms 
in  the  asymptotic  series  of  F(2e^),  the  absolute  value  of  R^  has  the  following 
upper  bound: 


(5.H) 


|R3 1 < max  [|F1(2©1) 


fF2(2Gl) 


where  F^(2©1)  and  F2(2©1)  are  given  by  (5.8)  and  (5«9)>  respectively. 

2 

The  actual  manner  in  which  (3*33)  converges  to  the  true  value  or  (4.3)  to 

the  true  value  F(26^),  is  nov.  — ..own,  but  it  i6  hoped  that  the  use  of  the  first  few 

2 

corrective  terms  may  result  in  a test  which  is  more  accurate  than  the  X approxima- 
tion, at  any  rate  for  moderately  large  values  of  n.  In  the  case  of  the  asymptotic 

o 

formula  for  the  c.d.f.  of  Tq  (4.3)  we  may  Judge  the  magnitude  of  the  error  involved 
in  using  the  first  few  terms  of  the  series  by  (5. 11),  which  turns  out  to  be  rather 
small  numerically  when  n is  sufficiently  large. 

The  author  wishes  to  express  his  indebtedness  to  Professor  Harold  Hotelling  for 
suggesting  this  problem  and  for  his  guidance  in  the  preparation  of  this  paper. 
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